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APPARATUS AND METHOD FOR PROVIDING VERY LARGE VIRTUAL 
STORAGE VOLUMES USING REDUNDANT ARRAYS OF DISKS 

5 TECHNICAL FIELD OF THE INVENTION 

This invention relates in general to redundant arrays of disks, such as RAID 
(Redundant Array of Independent Disks) sets. More specifically, the invention relates 
to an apparatus and method for providing virtual storage volumes, particularly very 
large virtual storage volumes (e.g., 100 Gigabytes (GB) or more), using redundant 
10 arrays of disks, such as RAID sets. 

BACKGROUND OF THE INVENTION 
Some computer software applications are organized according to what is 
referred to as a "single volume architecture," meaning that they store data in a single 

15 data file that resides on a single volume. This "volume" may be a physical volume, 
such as a disk drive, or it may be a virtual volume, such as a RAID set. The 
Exchange® e-mail program provided by Microsoft Corporation of Redmond, 
Washington is an example of such a single-volume-architecture application. 

In some cases, the single volume architecture of a particular software 

20 application can be problematic, because the size of the data file the application needs 
to store on a single volume exceeds the capacity of the volume. For example, 
implementations of Microsoft's Exchange® e-mail program in large organizations 
having many e-mail users can require a single-volume storage capacity exceeding 
100GB, which is greater than many conventional volumes, physical or virtual, can 

25 provide. Although it is possible to solve this problem by changing a single-volume- 
architecture application into a multi-volume-architecture application so that it saves 
data in multiple files spread across multiple volumes, such efforts can be prohibitively 
time-consuming and expensive. 

Accordingly, there is a need in the art for a very large virtual storage volume 

30 having the storage capacity necessary to meet the needs of a single-volume- 
architecture software application, such as Microsoft's Exchange® e-mail program. 
Preferably, such a storage volume should have built-in disaster tolerance capabilities 
through the use of remote mirroring or other techniques in order to ensure the integrity 
of its stored data. In addition, such a storage volume should preferably have cloning 
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capabilities so that data backup can occur off-line without interrupting read/write 
access to the data. 

SUMMARY OF THE INVENTION 
5 An inventive apparatus for providing a very large storage volume includes a 

plurality of disks and a local back-end controller that organizes and presents the disks 
as redundant arrays of disks (e.g., RAID-5 sets). Also, a local front-end controller 
stripes the redundant arrays of disks and presents the striped arrays as a very large 
storage volume. 

10 To provide local redundancy, another plurality of disks and an associated 

back-end controller can be provided, in which case the local front-end controller 
forms mirror sets from the redundant arrays of disks presented by both back-end 
controllers. In addition, a further plurality of disks and an associated back-end 
controller can be provided to enable off-line backup of the data stored on the volume 

15 by cloning the data onto the disks, and then using the disks as the data source for off- 
line backup. Also, a still further plurality of disks and an associated back-end 
controller can be provided at a remote location to protect against disasters occurring at 
the primary location (commonly referred to as "disaster tolerance"). The disks and 
back-end controllers providing cloning capabilities and disaster tolerance can be 

20 incorporated into the mirror sets formed by the local front-end controller. Further, 
spare disks can be provided on any or all of the back-end controllers to allow 
restoration of redundancy after the loss of any particular disk. 

If, for example, the disks each have 9.1GB of storage capacity and the local 
back-end controller organizes the disks into eleven, six-member RAID-5 sets, then the 

25 very large storage volume has a storage capacity in excess of 500GB, which should be 
adequate for most single-volume architecture programs. In addition, the redundancy 
restoration capabilities provided by the spare disks, the parity associated with RAID-5 
sets, and the mirroring ensures the integrity of the data stored on the very large storage 
volume. 

30 In another embodiment of this invention, the apparatus described above can be 

incorporated into an electronic system that also includes a host computer. 

In a further embodiment of this invention, data is stored on a plurality of disks 
by organizing the disks into a plurality of redundant arrays of disks. The redundant 




arrays of disks are striped together to form a virtual volume, and the data is then 
written to the virtual volume. 

In still another embodiment of this invention, data is again stored on a 
plurality of disks by organizing the disks into a plurality of redundant arrays of disks. 
5 Mirror sets are formed from the redundant arrays of disks, and these mirror sets are 
then striped together to form a virtual volume. The data is then written to the virtual 
volume. 

BRIEF DESCRIPTION OF THE FIGURES 
10 Figures 1A and IB is a diagram illustrating the organization of a very large 

volume constructed in accordance with this invention; and 

Figure 2 is a block diagram illustrating the very large volume of Figures 1 A 
and IB. 

15 DETAILED DESCRIPTION OF THE ILLUSTRATED EMBODIMENTS 

As shown in Figures 1 A and IB, a 500.5GB very large volume 10 constructed 
in accordance with this invention is organized so as to comprise a RAID-0 stripe set 
having eleven, 45.5GB RAID-1 mirror sets Ml-Mll as members. Of course, it will 
be understood by those having skill in the technical field of this invention that 

20 although the invention will be described with respect to a very large volume having a 
500.5GB storage capacity, the invention is not limited to any particular storage 
capacity. In addition, it will be understood that the invention is not limited to the use 
of any particular redundant array technology (e.g., RAID) and, consequently, is not 
limited to the use of any particular RAID levels (e.g., RAID-0, RAID-1). Also, it will 

25 be understood that the invention may include more or less than the eleven mirror sets 
Ml-Mll, and that the individual mirror sets Ml-Mll may be larger orsmaller in size 
than the 45.5GB described here. \ 

As used herein, a "RAID-0 stripe set" will be understood to refer to a virtual 
volume comprised of two or more member disks or volumes across which "stripes" of 

30 data are stored. Also, as used herein, a "RAID-1 mirror set" will be understood to 
refer to a virtual volume comprised of two or more member disks or volumes, each of 
which contains an identical copy of the data stored in the mirror set. 
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The mirror set Ml, for example, comprises five, 45.5GB RAID-5 sets PL1, 
RL1, CI, PR1, and RR1 as members. Similarly, the mirror set Mil comprises five, 
45.5GB RAID-5 sets PL11, RL11, Cll, PR11, and RR11 as members. For purposes 
of clarity, the RAID-5 set members of the mirror sets M2-M10 are illustrated but not 
5 labeled. 

Of course, it will be understood that the members of the mirror sets Ml-Mll 
can be other than RAID-5 sets (e.g., RAID-3 or RAID-4 sets). Also, as used herein, a 
"RAID-5 set" will be understood to refer to a virtual volume comprised of three or 
more independently accessible member disks or volumes having redundancy 

10 protection through parity information distributed across its members. 

The RAID-5 sets PL1-PL11 comprise the primary local storage copy of the 
data stored in the very large volume 10, which means that they are the primary 
location to which the data is written and from which the data is read. Also, the RAID- 
5 sets RL1-RL11 comprise a redundant local storage copy of the data, which provides 

15 mirroring-type redundancy for the stored data. In addition, the RAID-5 sets Cl-Cll 
comprise a cloning storage copy of the data, which is convenient for use in 
performing off-line data backups without interrupting read/write activities to the very 
large volume 10. Disaster tolerance is provided by the RAID-5 sets PR1-PR11, 
which comprise a primary remote storage copy, and the RAID-5 sets RR1-RR11, 

20 which comprise a redundant remote storage copy. Of course, it should be understood 
that embodiments of this invention may exclude the redundancy provided by the 
RAID-5 sets RL1-RL11, the cloning capability provided by the RAID-5 sets Cl-Cll, 
or the disaster tolerance provided by the RAID-5 sets PR1-PR11 and RR1-RR11. 

The RAID-5 set PL1, for example, comprises six, 9.1GB physical disks 12 

25 distributed across six SCSI busses busl-bus6 of a back-end controller (see Figure 2). 
Similarly, the RAID-5 set PL11 comprises six, 9.1GB physical disks 14 distributed 
across the six SCSI busses busl-bus6. In addition, six, 9.1GB spare physical disks 16 
seamlessly replace any failing disks on any of the busses busl-bus6 by rebuilding the 
data stored on failing disks from parity data, thereby restoring redundancy after a disk 

30 failure. 

As described herein, the very large volume 10 has a high degree of 
redundancy. If the bus3 physical disk 12 fails, for example, it is replaced by the bus3 
spare disk 16 by using parity data to rebuild the data stored on the failing bus3 




physical disk 12 onto the replacement bus3 spare disk 16. If bus3 itself fails, for 
example, the parity redundancy in the RAID-5 sets PL1-PL11 regenerates the data 
stored on the failing bus3. If the back-end controller (see Figure 2) associated with 
the RAID-5 sets PL1-PL11 fails, for example, the redundant local storage copy, 
5 comprised of the RAID-5 sets RL1-RL11, provides redundancy. Finally, if the front- 
end controller (see Figure 2) associated with the primary and redundant local storage 
copies and the cloning storage copy fails or is destroyed (e.g., due to a disaster), the 
primary remote storage copy, comprised of the RAID-5 sets PR1-PR11, and the 
redundant remote storage copy, comprised of the RAID-5 sets RR1-RR11, provide 
10 redundancy. 

As shown in a block diagram in Figure 2, the very large volume 10 is 
connected to a local host computer 20 that reads data from, and writes data to, the 
volume 10 via a local front-end controller 22 that acts as a mirroring and striping 
engine. In other words, the controller 22 forms the mirror sets Ml-Mll (see Figures 

15 1A and IB) and then stripes them so as to present them to the local host computer 20 
as the very large volume 10. 

The primary local storage copy comprises physical disks 24 (which include 
disks 12, 14, and 16 of Figures 1 A and IB) connected to a back-end controller 26. 
The controller 26 acts as a RAID-5 engine by forming the disks 24 into the RAID-5 

20 sets PL1-PL11 (see Figures 1A and IB) and presenting the sets PL1-PL11 to the 
front-end controller 22 as members. Similarly, the redundant local storage, clone, 
primary remote storage, and redundant remote storage copies comprise physical disks 
28, 30, 32, and 34, respectively, connected to back-end controllers 36, 38, 40, and 42, 
respectively, that act as RAID-5 engines by forming the disks 28, 30, 32, and 34 into 

25 the RAID-5 sets RL1-RL11, Cl-Cll, PR1-PR11, and RR1-RR11 and presenting 
these sets to front-end controller 22 as members. 

In addition, the very large volume 10 is connected to a remote host 
computer 44 that reads data from, and writes data to, the volume 10 via a remote 
front-end controller 46 that acts as a mirroring and striping engine for the primary and 

30 redundant remote storage copies. The local and remote host computers 20 and 44 are 
connected via a network interconnect 48, such as the internet or a dedicated network 
line. 



5 



In an alternative embodiment, the front-end controller 22 can be configured to 
present a 273GB unit and a 227.5GB unit to the local host computer 20, rather than 
the single 500.5GB unit described above. Of course, it should be understood that the 
front-end controller 22 can, in fact, be configured in a multitude of ways to group the 
5 mirror sets Ml-11 into between one and eleven total units, each potentially ranging in 
size from 45.5GB to 500.5GB. Further, it should be understood that the front-end 
controller 22 can be configured (typically using software) to partition the striped 
mirror sets Ml-11 into an infinite number and size of units. 

In addition, it should be understood that although this invention has been 
1 0 described with reference to an embodiment having two levels of stacked controllers, 
the invention is not limited to the two levels described. Rather, the invention includes 
within its scope any number of levels of stacked controllers. 

Although this invention has been described with reference to particular 
embodiments, the invention is not limited to these described embodiments. Rather, 
15 the invention is limited only by the appended claims, which include within their scope 
all equivalent devices and methods that operate according to the principles of the 
invention as described. 
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